Learning with Multiple Similarities Learning with Multiple Similarities
نویسندگان
چکیده
Title of dissertation: LEARNINGWITHMULTIPLE SIMILARITIES Abhishek Kumar, Doctor of Philosophy, 2013 Dissertation directed by: Professor Hal Daumé III Department of Computer Science The notion of similarities between data points is central to many classification and clustering algorithms. We often encounter situations when there are more than one set of pairwise similarity graphs between objects, either arising from different measures of similarity between objects or from a single similarity measure defined on multiple data representations, or a combination of these. Such examples can be found in various applications in computer vision, natural language processing and computational biology. Combining information from these multiple sources is often beneficial in learning meaningful concepts from data. This dissertation proposes novel methods to effectively fuse information from these multiple similarity graphs, targeted towards two fundamental tasks in machine learning classification and clustering. In particular, I propose two models for learning spectral embedding from multiple similarity graphs using ideas from co-training and co-regularization. Further, I propose a novel approach to the problem of multiple kernel learning (MKL), converting it to a more familiar problem of binary classification in a transformed space. The proposed MKL approach learns a “good” linear combination of base kernels by optimizing a quality criterion that is justified both empirically and theoretically. The ideas of the proposed MKL method are also extended to learning nonlinear combinations of kernels, in particular, polynomial kernel combination and more general nonlinear kernel combination using random forests. Learning with Multiple Similarities
منابع مشابه
Efficient Similarity Derived from Kernel-Based Transition Probability
Semi-supervised learning effectively integrates labeled and unlabeled samples for classification, and most of the methods are founded on the pair-wise similarities between the samples. In this paper, we propose methods to construct similarities from the probabilistic viewpoint, whilst the similarities have so far been formulated in a heuristic manner such as by k-NN. We first propose the kernel...
متن کاملKernel-based transition probability toward similarity measure for semi-supervised learning
For improving the classification performance on the cheap, it is necessary to exploit both labeled and unlabeled samples by applying semi-supervised learning methods, most of which are built upon the pairwise similarities between the samples. While the similarities have so far been formulated in a heuristic manner such as by k-NN, we propose methods to construct similarities from the probabilis...
متن کاملTransfer Learning with Multiple Sources via Consensus Regularized Autoencoders
Knowledge transfer from multiple source domains to a target domain is crucial in transfer learning. Most existing methods are focused on learning weights for different domains based on the similarities between each source domain and the target domain or learning more precise classifiers from the source domain data jointly by maximizing their consensus of predictions on the target domain data. H...
متن کاملInvestigate Diagnostic validity of the third edition of the new Wadkock-Johnson Cognitive Ability Scale in Learning Disabled Students in Ahvaz city
The purpose of this study was to investigate Learning disability diagnostic validation by Woodcock-Johnson III Tests of Cognitive Abilities in Ahvaz city. Statistical Society this study includes all male and female students with learning disabilities from the first to fifth grade of elementary school in Ahvaz. In the academic year 2012-2013, from the state and non-governmental centers, the indi...
متن کاملCycle Time Optimization of Processes Using an Entropy-Based Learning for Task Allocation
Cycle time optimization could be one of the great challenges in business process management. Although there is much research on this subject, task similarities have been paid little attention. In this paper, a new approach is proposed to optimize cycle time by minimizing entropy of work lists in resource allocation while keeping workloads balanced. The idea of the entropy of work lists comes fr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013